Encoding models for scholarly literature
نویسندگان
چکیده
In this chapter, the authors examine the issue of digital formats for document encoding, archiving and publishing, through the specific example of “born-digital” scholarly journal articles. This small area of electronic publishing represents a microcosm of the state of the art, and provides a good basis for this discussion. The authors will begin by looking at the traditional workflow of journal editing and publication, and how these practices have made the transition into the online domain. They will examine the range of different file formats in which electronic articles are currently stored and published. They will argue strongly that, despite the prevalence of binary and proprietary formats such as PDF and MS Word, XML is a far superior encoding choice for journal articles. Next, the authors look at the range of XML document structures (DTDs, Schemas) which are in common use for encoding journal articles, and consider some of their strengths and weaknesses. The authors will suggest that, despite the existence of specialized schemas intended specifically for journal articles (such as NLM), and more broadly-used publication-oriented schemas such as DocBook, there are strong arguments in favour of developing a subset or customization of the Text Encoding Initiative (TEI) schema for the purpose of journal-article encoding; TEI is already in use in a number of journal publication projects, and the scale and precision of the TEI tagset makes it particularly appropriate for encoding scholarly articles. They will outline the document structure of a TEI-encoded journal article, and look in detail at suggested markup patterns for specific features of journal articles. Next, they will look briefly at how XML-based publication systems work, and what advantages they bring over electronic publication methods based on other digital formats. DOI: 10.4018/978-1-61692-834-6.ch005
منابع مشابه
Challenges of serials text encoding in the spirit of scholarly communication
Purpose – The paper aims to describe an electronic text project involving a scholarly history journal, and to share findings related to the encoding of serials using the text encoding interchange (TEI) guidelines. Design/methodology/approach – The project was completed using a combination of in-house and outsourced digitization and encoding, employing a variety of methods for quality control an...
متن کاملScholarly communication and bibliometrics: I. Scholarly communication model Literature Review
This article represents a literature review that has two parts. Her is talking about the first part: scholarly communication model. In recent years there has been a resurgence of interest both in scholarly communication as a research area and in the application of bibliometrics as a research method. This article is a compilation of scholarly communication models at the intersection of scholarly...
متن کاملLiterature Growth and the Retrieval System in Scholarly Communication
A series of simple models indicates that literature growth is a prime objective of literature communications systems. The literature communication system is clearly successful in the sciences, and trivia and redundancy are a useful part of its operations. If one treats the literature and its behavior as physically observable phenomena one can base selection and subject analysis on objective cri...
متن کاملAutomatic Construction of Evaluation Sets and Evaluation of Document Similarity Models in Large Scholarly Retrieval Systems
Retrieval systems for scholarly literature offer the ability for the scientific community to search, explore and download scholarly articles across various scientific disciplines. Mostly used by the experts in the particular field, these systems contain user community logs including information on user specific downloaded articles. In this paper we present a novel approach for automatically eva...
متن کاملReceptive Field Encoding Model for Dynamic Natural Vision
Introduction: Encoding models are used to predict human brain activity in response to sensory stimuli. The purpose of these models is to explain how sensory information represent in the brain. Convolutional neural networks trained by images are capable of encoding magnetic resonance imaging data of humans viewing natural images. Considering the hemodynamic response function, these networks are ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/0906.0675 شماره
صفحات -
تاریخ انتشار 2009